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BACKGROUND OF THE INVENTION 
Field of the Invention 

This invention relates generally to electronic and 
computer synthesis of sounds. More particularly, it re- 
lates to devices and methods for the synthesis of complex 
ambient background and foreground impact sounds that 
sound neither repetitive nor looped. 

Description of the Related Art 

Many computer-implemented games and simulations con- 
tain background and foreground sounds that help create a 
highly realistic environment. These complex sounds are 
often generated using some version of a wavetable synthe- 
sis algorithm. A wavetable is a table of stored sound 
waves, typically stored in read-only memory on a sound 
card chip, that are digitized samples of actual recorded 
sound. Complex sounds are generated by combining and 
modifying the stored sound waves. One of the primary 
techniques used by wavetable synthesizers to conserve 
sample memory space is the looping of sampled sound seg- 
ments. For example, acoustic string instrument sounds can 
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be modeled as attack and sustain portions, and the sus- 
tain portion in particular can be reproduced with a re- 
peated sound sample multiplied by a continually decreas- 
ing gain factor. In order to generate and complete in- 
5 strument sound, only a relatively small sample must be 

stored. 

While wavetable synthesis techniques have proven 
very successful in generating musical instrument sounds, 
they are inadequate for synthesizing realistic game 
10 sounds, which are typically large-scale sound events made 

O up of collections of smaller, simpler sound events. Exara- 

O 

15 pies of game sounds include ambient background sounds 

C3 such as forest or crowd noises and complex foreground 

fli 

sounds such as car crashes or explosions. With standard 
W 15 wavetable synthesis, background sounds such as crowds 

1,1. sound looped, while complex impact sounds are repetitive, 

Q i.e., sound identical each time they occur. Repetitive or 

[q looped sounds begin to sound unnatural very quickly, dra- 

C3 matically reducing the realism conveyed by the game. 

20 Clearly, there is a need for improved techniques for gen- 

erating realistic game sounds for computer simulators and 
games , 

SUMMARY OF THE INVENTION 

25 This invention generates more realistic complex 

sounds for computer simulators, games and the like by 
generating multiple different kinds of simpler sounds 
with repetition rates that vary in accordance with random 
time distributions. The average rate of generating the 

30 simpler sounds can also be made variable, in accordance 

with either user inputs or a predetermined function. 
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Various sound parameters, such as wave selection, pitch 
distribution, pan distribution and amplitude distribu- 
tion, can themselves be established with random value 
distributions that in turn are functions of inputs such 
5 as mean, standard deviation and minimum and maximum val- 

ues, at least some of which have their own random distri- 
butions . 

BRIEF DESCRIPTION OF THE DRAWINGS 
10 FIG. 1 is a block diagram of an event modeling sys- 

J,^ tern of the present invention. 

C3 FIG. 2 is a block diagram of a trigger process that 

fj has both an intensity envelope and a constant intensity 

01 as inputs . 

1^1 15 FIGS. 3A-3B are block diagrams of first and second 

= embodiments, respectively, of the trigger process of the 

f% system of FIG. 1. 

'^Z FIG. 4 is a block diagram of exemplary parameter 

p selectors of the system of FIG. 1. 

^* 20 FIGS. 5A-5B are block diagrams illustrating two im- 

plementations of varying the parameter selectors with the 
intensity envelope. 

FIG. 6 is a block diagram of a nested trigger proc- 
ess of the present invention. 

25 

DETAILED DESCRIPTION OF THE INVENTION 

Although the following detailed description contains 
many specifics for the purposes of illustration, anyone 
30. of ordinary skill in the art will appreciate that many 

variations and alterations to the following details are 



within the scope of the invention. Accordingly, the fol- 
lowing preferred embodiment of the invention is set forth 
without any loss of generality to, and without imposing 
limitations upon, the present invention. 

The present invention solves the above described 
problem by extending the wavetable concept to a statisti- 
cal sound event modeler that effectively creates sounds 
that sound neither looped nor repetitive. Typical appli- 
cations include computer games, multimedia exhibits, and 
music synthesis. The invention is particularly well 
suited for large-scale sound events that are collections 
of smaller, simpler sound events. For example, a car 
crashing into a wall is a large-scale sound event made up 
of individual crunch sounds, sounds of various objects 
falling on the ground, and different scrape sounds, among 
others. When generated by the method of the present in- 
vention, a car crash sound can be repeated many times 
while sounding different each time. 

The sound generation tool of the present invention 
provides the following advantages : 

• Low CPU usage 

• Scaleable in CPU usage 

• Useful for generating both complex continuous sounds 
and complex impact sounds 

• Has a more general usefulness than a specific physical 
model 

• Can be understood and learned easily by sound designers 

• Allows sound designers to use their own wave files 

The techniques of the present invention may be im- 
plemented in the form of instructions stored in a memory 
and executed by a general purpose microprocessor present 



in a desktop computer, laptop computer, viedo arcade 
game, and the like. The techniques of the present inven- 
tion may also be implemented in hardware, i.e., using an 
ASIC that is part of a computer system. The synthesized 
signals from the microprocessor or ASIC are output to a 
user using an audio sound system that is either internal 
to the system or part of an external sound system con- 
nected to the computer system. The hardware preferably 
includes conventional state-of-the-art components well 
known in the art. Because the primary distinguishing fea- 
tures of the present invention relate to the specific 
synthesis techniques, the following description will fo- 
cus on these techniques . 

FIG. 1 is a block diagram illustrating the three 
main components of the method. A trigger process gener- 
ates a collection of events distributed through time, 
with the average rate of event generation controlled by 
one or more intensity parameters. Next, parameter selec- 
tors choose the values of various parameters according to 
statistical distributions that themselves have controlla- 
ble input parameters. Finally, a playback engine plays 
waves according to the selected parameters when triggered 
by the trigger process. 

The trigger process selects a random time lag be- 
tween subsequent events that make up the large-scale or 
complex events. The intensity is a parameter, typically 
chosen by a user, that determines the average rate of 
generation of simple events by the trigger process. That 
is, the intensity is directly related to the mean of the 
probability density function of the time between single 
events. When constructing a complex sound, the user se- 



lects either a constant intensity or an intensity that 
changes with time; a changing intensity is referred to as 
an intensity envelope. For example, ambient sound such as 
cricket chirps are typically generated at a constant av- 
erage rate. That is, while the time between individual 
chirps fluctuates randomly to provide a natural environ- 
ment, the average time between chirps is constant over a 
large time period. In contrast, impact sounds such as car 
crashes, explosions, or dog barks are composed of indi- 
vidual sound events whose rate of generation is prefera- 
bly not constant over the duration of the sound. For ex- 
ample, a car crash can begin with rapidly generated 
crunch sounds and gradually trail off into more slowly 
generated glass breaking sounds. After some experimenta- 
tion, a user can determine the correct intensity envelope 
to achieve the desired sound. In FIG. 1, the intensity is 
shown as a time-varying envelope. 

Combinations of intensity parameters are also possi- 
ble. FIG. 2 illustrates a method in which an intensity 
envelope is combined with a constant intensity. The rate 
at which the trigger process generates events is deter- 
mined by the sum of the constant intensity and the value 
of the intensity envelope at a particular time. Such an 
intensity combination is useful for background sounds 
that occasionally become more prominent, such as a crowd 
sound with occasional loud cheers or boos from the crowd. 
The superimposed intensity envelope can also be deter- 
mined from other factors. For example, a loud crowd cheer 
can be triggered when a particular game event, such a 
goal being scored, occurs. In this case, the intensity 



envelope is added to the constant intensity only after 
the game event occurs . 

There are two main embodiments of the trigger proc- 
ess, both of which are characterized by a particular sta- 
tistical distribution of the time between individual 
events. In the embodiment of FIG. 3A, the trigger process 
samples white noise and generates events when a strongly 
lowpass-f iltered noise signal crosses zero in an upward- 
going direction. For example, the filter can be imple- 
mented as two cascaded onepole filters (or as an equiva- 
lent second-order filter) : 

7^(z) = [{l + a.)(l + aO]/[(H-a,z-')(l + a.^-)]. 

The rate of stochastic events is determined by the 
filter bandwidth, which is obtained from the user- 
selected (or system-selected) parameter according to the 
following equation: 

where R^^ is the desired average rate of events (in 
events/second), and is the calculation rate (i.e., 

sampling rate) of the filter or filters. A smaller band- 
width decreases the average rate of event generation, 
i.e., increases the time between individual events. Using 
this method, the time distribution of event generation is 
described by the following normalized probability density 
function p(At) of the time between events: 



p{At) = ceAte-^, 



where a is a normalization constant. This embodiment has 
been shown to work well for a wide variety of complex 
sounds . 

An alternative embodiment of the trigger process is 
illustrated in FIG. 3B. In this embodiment, event genera- 
tion is based directly on predefined random distribution. 
After an event is generated, a random generator selects a 
value of the time delay. At , until the next event should 
be generated. After the selected time delay passes, a new 
event is generated. This new event then triggers the ran- 
dom generator to select a time delay for the next event 
according to the predefined random distribution. FIG. 3B 
illustrates the time delay being generated after each 
event. Alternatively, the entire sequence of event delays 
can be generated first, and then the events triggered 
according to the sequence . 

The random generator can use any suitable random 
distribution. Some examples of suitable distributions can 
be found in Charles Dodge and Thomas a. Jerse, Computer 
Music: Synthesis, Composition, and Performance, New York: 
Schirmer Books, 1985, herein incorporated by reference. 
Csound, a programming language for software-only synthe- 
sis and processing of sound, uses a variety of distribu- 
tions that may also be used in the trigger process of the 
present invention: linrand, trirand, exprand, bexprend, 
cauchy, pcauch, poisson, gauss, weibull, beta, and uni- 
form. Alternatively, arbitrary user-defined distributions 
may be used. In all cases, it is necessary to determine 
the relationship between the user-defined intensity and 
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some parameter of the distribution, i.e., the value of 
the distribution parameter that determines the resultant 
average rate of event generation. For known distribu- 
tions, the derivation is straightforward. For arbitrary 
5 distributions, the relationship can be determined empiri- 

cally by selecting a value of the distribution parameter 
and measuring the resulting event generation rate over a 
long time period. This measurement can be performed for a 
variety of values of the event parameter, and thus a cor- 
10 rect parameter value corresponding to the user-selected 

g intensity can be chosen. To select an actual event delay 

P given a user-chosen parameter, a lookup table derived 

f| from the user-supplied probability density function is 

yj - applied to the output of a uniform random generator, 

y 1^ After the trigger process generates an event, the 

algorithm passes tot he parameter selection. An exemplary 
Q embodiment of the parameter selection component of the 

invention is illustrated in the block diagram of FIG. 4. 
O Four different parameters are shown; however, any number 

'^^ 20 or types of parameters is within the scope of the present 

invention. The parameter selectors shown are wave selec- 
tion, pitch distribution, pan {i.e., left-to-right sound 
movement) distribution, and amplitude (i.e., loudness) 
distribution. Each parameter selector is implemented as a 
25 Gaussian distribution with independently selectable mean 

and standard deviation, as well as fixed minimum and 
maximum values. In operation, the user selects desired 
values of the mean, standard deviation, minimum, and 
maximum for each parameter or for a subset of the parame- 
30 ters. Each time an event is generated by the trigger 

process, each parameter selector chooses a random parame- 
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ter value according to its distribution. If the parameter 
value does not fall within the limits set by the fixed 
minimum and maximum, then a new value is chosen according 
to the same distribution until a value inside the preset 
limits is found. Thus each module has an effective dis- 
tribution equivalent to the original distribution with 
the intervals outside the limits set to zero (and renor- 
malized) . the chosen parameter values are then applied to 
the wave chosen by the wave selection process. 

Other distributions (i.e., non-Gaussian) can instead 
be used for the parameter selectors. For example, the 
same distributions as noted above for the trigger process 
can be applied to parameter selection. The user can se- 
lect one of the multiple distributions provided or, al- 
ternatively, specify an arbitrary distribution. As with 
the trigger process, arbitrary user-supplied distribu- 
tions can be implemented by deriving a lookup table from 
the probability density function and applying it to the 
output of a uniform random generator. A different distri- 
bution can be selected for each parameter, or the same 
distribution selected for all parameters. 

In an alternative embodiment, the inputs (mean, 
standard deviation, minimum, maximum) to the parameter 
selector distributions are varied in accordance with the 
variation in the trigger process intensity parameter. 
This is particularly useful in cases where the wave se- 
lection or pitch distribution should shift as the inten- 
sity changes. Returning to the car crash example, high- 
intensity events at the beginning of the car crash should 
be crunch sounds, while lower-intensity events at the end 
of the car crash should be higher-pitched glass breaking 
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sounds. FIG. 5A illustrates one possible implementation. 
A simple linear transformation or lookup table is applied 
tot he intensity envelope to obtain inputs for the pa- 
rameter selectors. The intensity envelope can be used to 
control some or all of the mean, standard deviation, 
minimum, and maximum values input to the parameter selec- 
tor. It can also be used to control some or all of the 
parameter selectors . The user can choose which inputs and 
which parameters depend upon the intensity envelope. The 
transformation applied to the intensity envelope can vary 
for the different inputs and parameters. Alternatively, 
as shown in FIG. 5B, envelope generators create separate 
time envelopes from the intensity envelope. These time 
envelopes are then applied to each parameter or each in- 
put to control the time dependence of the parameter. The 
parameter selector inputs can also depend upon events 
occurring within the game . As with the above example in 
which the trigger intensity changes after a goal is 
scored, the types of waves generated can change after an 
event occurs. This can be implemented by changing the 
minimum and maximum inputs to the wave selection process. 
Similarly, parameter values applied to the selected wave 
can also change in response to a particular game occur- 
rence . 

Finally, the playback engine generates sound accord- 
ing to the selected parameters . The playback engine is 
preferably a wavetable synthesizer such as a DLS 
(downloadable sound) unit generator containing samples 
appropriate to modeling a complex event. In the DLS unit 
generator, sounds from wavetables are downloaded inot 
specific memory locations corresponding to specific pro- 
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gram change nijmbers. Parameter implementation is prefera- 
bly through standard controls within the playback engine. 
For example, in the DLS unit generator, pitch control is 
via a keyNum control, amplitude control is via a velocity 
control, and pan control is via a pan controller. Wave 
selection is via selection of program change, provided 
that each sound sample has a unique program change num- 
ber. In a standard DLS synthesizer, the keyNum-to-pitch 
mapping is hardwired at one semitone-per-keyNum, and thus 
there tends to be an unintended musical sound, especially 
if the waves are strongly pitched. If desired, finer 
pitch control can be implemented using the pitchBend com- 
mand. 

While the present invention has been described in 
the context of a digital wavetable synthesizer, it will 
be apparent to one of ordinary skill in the art that it 
can be applied to any arbitrary sound synthesizer. For 
example, the trigger process can be in communication with 
an analog synthesizer. In general terms, the trigger 
process can trigger any sound element, not necessarily a 
wave, to which appropriate parameters can be applied. 
Thus the parameters are not restricted to the exemplary 
parameters described above. In fact, any relevant parame- 
ter that can be chosen stochastically for each event is 
within the scope of the present invention. Typically, the 
parameters are selected in dependence on the particular 
playback engine chosen. Some other examples of parameters 
are the filtering of waves, amount of reverberation, or 
three-dimensional positioning of waves. 

Note that the invention does not require the appli- 
cation of random distributions to the trigger process and 
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to every parameter. In some cases, it is preferable to 
have some of the algorithm elements be deterministic or 
pseudo- random. For example, rather than being triggered 
by the random trigger process, an event can be triggered 
at constant time intervals or by an occurrence within the 
game. The triggered event still has parameters chosen 
according to the random parameter distributions. Alterna- 
tively, randomly triggered events may have only some of 
their parameters with random distributions, while others 
have values determined by the user or the system. 

A variety of combinations of algorithm elements, as 
well as combinations of algorithm elements with standard 
prior art elements, are possible. In some cases, it is 
desirable to use multiple distributions sequentially 
within the same event, blending smoothly from one distri- 
bution to the next. For example, a gradual shift from the 
sound of rain on leaves to the sound of rain on a metal 
roof can be implemented by morphing from a first parame- 
ter distribution to a second parameter distribution, each 
of which is appropriate for its respective sound. Differ- 
ent intensity envelopes for the trigger process can also 
be used sequentially or simultaneously for different com- 
plex sound events or for different components of a single 
complex sound event. Complex sounds can also be con- 
structed from a combination of sounds triggered by the 
present invention and sounds generated by prior art tech- 
niques. For example, standard sustained background loops 
can be combined with non-repetitive impact sounds gener- 
ated by the method of the present invention. In general, 
aspects of the present invention can be implemented as 
one component of a larger sound environment. 
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A further example is shown in FIG. 6, which illus- 
trates a nested trigger process. Rather than trigger pa- 
rameter selection for a particular event, the top-level 
trigger process instead triggers other trigger processes. 
That is, each low-level process is initiated when an 
event is generated by the top-level process. Of course, 
any number of levels of nested processes can be employed. 
Additionally, all of the variations of single trigger 
processes described above can be applied to each individ- 
ual trigger process in the hierarchical system. Each low- 



C| level process can have a different intensity envelope, 

■'15 parameters, and parameter input values. While not shown 

Q in FIG. 6, a top-level trigger process can trigger both 

Uj sound event and lower-level trigger processes. 

W 15 For example, a battle scene can have complex bomb sounds 

L that trigger subsequent explosion sounds . 

O In summary, the present invention can be used to 

y3 create a variety of sound types, including, but not lim- 

p ited to, ambient background textures, such as forest am- 



biance; foreground impact sounds, such as crashes and 
explosions; and compound textures containing background 
elements as well as foreground enveloped events, such as 
crowd sounds with cheers and boos. As described above, 
the present invention is particularly useful for generat- 
ing statistically-triggered small component sounds that 
can be used either as non-looping ambient backgrounds or 
as components of complex impact foreground sound textures 
or simple one-shot sounds used to unify the base quality 
of the complex impact sounds. 



